Improved Automatic Bird Identification through Decision Tree based Feature Selection and Bagging
نویسنده
چکیده
This paper presents a machine learning technique for bird species identification at large scale. It automatically identifies about a thousand different species in a large number of audio recordings and provides the basis for the winning solution to the LifeCLEF 2015 Bird Identification Task. To process the very large amounts of audio data and to achieve similar good results compared to previous identification challenges new methods e.g. downsampling of spectrogram images for faster feature extraction, advanced feature selection via decision tree based feature ranking and bootstrap aggregating using averaging and blending were tested and evaluated.
منابع مشابه
Improving Bird Identification using Multiresolution Template Matching and Feature Selection during Training
This working note describes methods to automatically identify a large number of different bird species by their songs and calls. It focuses primarily on new techniques introduced for this year’s task like advanced spectrogram segmentation and decision tree based feature selection during training. Considering the identification of dominant species, previous results of the LifeCLEF Bird Identific...
متن کاملUsing Feature Selection with Bagging and Rule Extraction in Drug Discovery∗
This paper investigates different ways of combining feature selection with bagging and rule extraction in predictive modeling. Experiments on a large number of data sets from the medicinal chemistry domain, using standard algorithms implemented in the Weka data mining workbench, show that feature selection can lead to significantly improved predictive performance. When combining feature selecti...
متن کاملOpinion Mining Using Decision Tree Based Feature Selection through Manhattan Hierarchical Cluster Measure
Opinion mining plays a major role in text mining applications in consumer attitude detection, brand and product positioning, customer relationship management, and market research. These applications led to a new generation of companies and products meant for online market perception, reputation management and online content monitoring. Subjectivity and sentiment analysis focus on private states...
متن کاملUsing Data Mining Models for Differential Diagnosis of Iron Deficiency Anemia and β-thalassemia Minor
Introduction: One of the most common types of anemia is Iron deficiency anemia that its main differential diagnosis is β-thalassemia minor. The rapid and accurate screening of β-thalassemia minor has particular importance for pre-marriage medical counseling and the prevention of the birth of neonates with β-thalassemia major and differentiating it from iron deficiency anemia to avoid unnecessar...
متن کاملUsing Data Mining Models for Differential Diagnosis of Iron Deficiency Anemia and β-thalassemia Minor
Introduction: One of the most common types of anemia is Iron deficiency anemia that its main differential diagnosis is β-thalassemia minor. The rapid and accurate screening of β-thalassemia minor has particular importance for pre-marriage medical counseling and the prevention of the birth of neonates with β-thalassemia major and differentiating it from iron deficiency anemia to avoid unnecessar...
متن کامل